SMOTified-GAN for Class Imbalanced Pattern Classification Problems

نویسندگان

چکیده

Class imbalance in a dataset is major problem for classifiers that results poor prediction with high true positive rate (TPR) but low negative (TNR) majority training dataset. Generally, the pre-processing technique of oversampling minority class(es) are used to overcome this deficiency. Our focus on using hybridization Generative Adversarial Network (GAN) and Synthetic Minority Over-Sampling Technique (SMOTE) address class imbalanced problems. We propose novel two-phase approach involving knowledge transfer has synergy SMOTE GAN. The unrealistic or overgeneralized samples transformed into realistic distribution data by GAN where there not enough available process them itself effectively. named it SMOTified-GAN as works pre-sampled produced rather than randomly generating itself. experimental prove sample quality been improved variety tested benchmark datasets. Its performance up 9\% from next best algorithm F1-score measurements. time complexity also reasonable which around $O(N^2d^2T)$ sequential algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A bias correction function for classification performance assessment in two-class imbalanced problems

This paper introduces a framework that allows to mitigate the impact of class imbalance on most scalar performance measures when used to evaluate the behavior of classifiers. Formally, a correction function is defined with the aim of highlighting those classification results that present moderately higher prediction rates on the minority class. Besides, this function punishes those scenarios th...

متن کامل

On Mining Fuzzy Classification Rules for Imbalanced Data

Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...

متن کامل

A New Performance Evaluation Method for Two-Class Imbalanced Problems

In this paper, we introduce a new approach to evaluate and visualize the classifier performance in two-class imbalanced domains. This method defines a two-dimensional space by combining the geometric mean of class accuracies and a new metric that gives an indication of how balanced they are. A given point in this space represents a certain trade-off between those two measures, which will be exp...

متن کامل

A Review of Multi-Class Classification for Imbalanced Data

Prediction and correct voting is critical task in imbalance data multi-class classification. Accuracy and performance of multi-class depends on voting and prediction of new class data. Assigning of new class of imbalance data generate confusion and decrease the accuracy and performance of classifier. Various authors and research modified the multiclass classification approach such as one agains...

متن کامل

On Mining Fuzzy Classification Rules for Imbalanced Data

Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3158977